Diphone Speech Synthesis System for Arabic Using MARY TTS
نویسندگان
چکیده
Concatenative speech synthesis systems generate speech by concatenating small prerecorded speech units which are stored in the speech unit inventory. The most commonly used type of these units is the diphone which is a unit that starts at the middle of one phone and extends to the middle of the following one. Diphones have the advantage of modeling coarticulation by including the transition to the next phone inside the diphone itself. In this paper, a diphone speech synthesis system for the Arabic language using MARY TTS has been developed and evaluated by two types of tests which are the Diagnostic Rhyme Test (DRT) that measures the intelligibility of the synthesized speech and the Categorical Estimation (CE) test that measures the overall quality of the synthesized speech. The results of these tests are illustrated in the experiments and results section.
منابع مشابه
Estimating phone lengths for a diphone-based text-to-speech system for Arabic
We have described elsewhere a text-to-speech (TTS) system for Modern Standard Arabic which imposes a pitch contour on the output to indicate the force of the utterance (statement/query/command) and to mark emphasis (as specified by the use of non-canonical word orders). This TTS uses the diphone-based speech synthesiser Mbrola, for which you have to provide information about phone lengths. In t...
متن کاملIncreased Diphone Recognition for an Afrikaans TTS system
In this paper we discuss the implementation of an Afrikaans TTS system that is based on diphones. Using diphones makes the system flexible but presents other challenges. A previous effort to design an Afrikaans TTS system was done by SUN. They implemented a TTS system based on full words. A full word based TTS system produces more natural sounding speech than when the system is designed using o...
متن کاملDiphone-Based Concatenative Speech Synthesis System for Mongolian
This paper describes the first Text-to-Speech (TTS) system for the Mongolian language, using the general speech synthesis architecture of Festival. The TTS is based on diphone concatenative synthesis, applying TD-PSOLA technique. The conversion process from input text into acoustic waveform is performed in a number of steps consisting of functional components. Procedures and functions for the s...
متن کاملSpeech Data Analysis for Diphone Construction of a Maori Online Text-to-speech Synthesizer
One of the main types of speech processing technologies today is text-to-speech (TTS) synthesis. A well established speech synthesizer technique called ‘diphone concatenation’ uses a speakers processed speech examples to apply a more human-like response to the TTS synthesis system. This methodology has been used to construct many diphone databases for various languages, and was the basis for bu...
متن کاملImplementation and evaluation of a text-to-speech synthesis system for turkish
In this paper, a diphone based Text-to-Speech (TTS) system for the Turkish language is presented. Turkish is the official language of Turkey, where it is the native language of 70 million people and it is also widely spoken in Asia (Azerbaidjain, Uzbekhstan, Kazakhstan, Kirgizhstan and Iran), Cyprus and the Balkans. The research has been done through a visiting internship at CSLR (the Center fo...
متن کامل